Picture for Mingyang Li

Mingyang Li

Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions

Add code
Jan 29, 2026
Viaarxiv icon

Emerging from Ground: Addressing Intent Deviation in Tool-Using Agents via Deriving Real Calls into Virtual Trajectories

Add code
Jan 21, 2026
Viaarxiv icon

Know Thy Enemy: Securing LLMs Against Prompt Injection via Diverse Data Synthesis and Instruction-Level Chain-of-Thought Learning

Add code
Jan 08, 2026
Viaarxiv icon

All Changes May Have Invariant Principles: Improving Ever-Shifting Harmful Meme Detection via Design Concept Reproduction

Add code
Jan 08, 2026
Viaarxiv icon

SimuFreeMark: A Noise-Simulation-Free Robust Watermarking Against Image Editing

Add code
Nov 14, 2025
Figure 1 for SimuFreeMark: A Noise-Simulation-Free Robust Watermarking Against Image Editing
Figure 2 for SimuFreeMark: A Noise-Simulation-Free Robust Watermarking Against Image Editing
Figure 3 for SimuFreeMark: A Noise-Simulation-Free Robust Watermarking Against Image Editing
Figure 4 for SimuFreeMark: A Noise-Simulation-Free Robust Watermarking Against Image Editing
Viaarxiv icon

MS2Edge: Towards Energy-Efficient and Crisp Edge Detection with Multi-Scale Residual Learning in SNNs

Add code
Nov 05, 2025
Viaarxiv icon

Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems

Add code
Jun 06, 2025
Figure 1 for Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems
Figure 2 for Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems
Figure 3 for Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems
Figure 4 for Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems
Viaarxiv icon

Universal Visuo-Tactile Video Understanding for Embodied Interaction

Add code
May 28, 2025
Figure 1 for Universal Visuo-Tactile Video Understanding for Embodied Interaction
Figure 2 for Universal Visuo-Tactile Video Understanding for Embodied Interaction
Figure 3 for Universal Visuo-Tactile Video Understanding for Embodied Interaction
Figure 4 for Universal Visuo-Tactile Video Understanding for Embodied Interaction
Viaarxiv icon

AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery

Add code
May 27, 2025
Viaarxiv icon

MPRM: A Markov Path-based Rule Miner for Efficient and Interpretable Knowledge Graph Reasoning

Add code
May 18, 2025
Viaarxiv icon